127 research outputs found

    BRANCHING NEURAL NETWORKS

    Get PDF
    A conditional deep learning model that learns specialized representations on a decision tree is described. Unlike similar methods taking a probabilistic mixture of experts (MoE) approach, a feature augmentation based method is used to jointly train all network and decision parameters using back–propagation, which allows for deterministic binary decisions at both training and test time, specializing subtrees exclusively to clusters of data. Feature augmentation involves combining intermediate representations with scores or confidences assigned to branches. Each representation is augmented with all of the scores assigned to the active branch on the computational path to encode the entire path information, which is essential for efficient training of decision functions. These networks are referred to as Branching Neural Networks (BNNs). As this is an approach that is orthogonal to many other neural network compression methods, such algorithms can be combined to achieve much higher compression rates and further speedups

    Learning to Navigate the Energy Landscape

    Full text link
    In this paper, we present a novel and efficient architecture for addressing computer vision problems that use `Analysis by Synthesis'. Analysis by synthesis involves the minimization of the reconstruction error which is typically a non-convex function of the latent target variables. State-of-the-art methods adopt a hybrid scheme where discriminatively trained predictors like Random Forests or Convolutional Neural Networks are used to initialize local search algorithms. While these methods have been shown to produce promising results, they often get stuck in local optima. Our method goes beyond the conventional hybrid architecture by not only proposing multiple accurate initial solutions but by also defining a navigational structure over the solution space that can be used for extremely efficient gradient-free local search. We demonstrate the efficacy of our approach on the challenging problem of RGB Camera Relocalization. To make the RGB camera relocalization problem particularly challenging, we introduce a new dataset of 3D environments which are significantly larger than those found in other publicly-available datasets. Our experiments reveal that the proposed method is able to achieve state-of-the-art camera relocalization results. We also demonstrate the generalizability of our approach on Hand Pose Estimation and Image Retrieval tasks

    A multimodal 3d healthcare communication system

    Get PDF
    W e p r e s e n t a system that integrates gesture recognition and 3D talking head technologies for a patient communication a p p l i c a t i o n a t a hospital or healthcare setting for supporting patients treated in bed. As a multimodal user interface, we get the input from patients using hand gestures and provide feedback by using a 3D talking avatar. Index Terms — gesture recognition, multimodal user interfaces, 3D facial animation

    Fragrant grapes on the trail of flavour as part of gastronomy tourism

    Get PDF
    Anadolu, binlerce yıllık geçmişiyle üzümün anavatanı ve gen merkezi konumundadır. Bu coğrafyada yaşayan her medeniyet üzümü ve üzümden elde ettikleri ürünleri mutfak kültürlerinin içine almışlar ve mutfak kültürlerini farklılaştırmışlardır. Bu farklılık ise her geçen gün turizm açısından daha fazla çekicilik unsuru olarak değerlendirilmektedir. Bu doğrultuda değişen turizm dinamikleri ile gastronomi turistleri lezzetlerin takibini yaparak seyahatlerini planlamaktadırlar. Bu çalışma ile kokulu üzümün, Karadeniz Bölgesi’nin Orta ve Doğu sahil şeridinde yer alan illerdeki yeri ve önemi ayrıca gastronomi turizmine yönelik bir ürün olarak potansiyeli ele alınmıştır. Gastronomi turizmine yönelik ilginin arttığı sonucuna bağlı olarak, alternatif bir gastronomik ürün rotası önermesi açısından çalışma önem arz etmektedir. Çalışmanın teorik alt yapısı planlı davranış teorisine dayandırılmıştır. Çalışmanın evrenini Karadeniz Bölgesi’nin Orta ve Doğu sahil şeridinde yer alan; Samsun, Ordu, Giresun, Trabzon, Rize ve Artvin illeri oluşturmaktadır. Çalışmanın verileri nitel araştırma yöntemlerinden doküman inceleme, gözlem ve görüşme tekniğiyle toplanmıştır. Çalışmanın deseni örnek olaydır. Çalışmada veri analizi planı oluşturulmuş, betimsel analiz yöntemi ile veriler analiz edilmiştir. Çalışma ile elde edilen veriler; kokulu üzümün bölge için değerlendirilmesi gereken bir potansiyele sahip olduğu sonucunu ortaya koymaktadır. Dolayısıyla kokulu üzüme yönelik gastronomi rotalarının oluşturulması ile destinasyon çekiciliği arttırılarak destinasyon imajının gelişmesine böylece yerel halkın yaşam kalitesinin yükseltilmesine önemli katkılar sağlanabilecektirAnatolia, with its thousands of years of history, is the homeland and gene center of grapes. Every civilization living in this geography has included grapes and products produced from them into their culinary cultures and differentiated their culinary cultures. This difference is considered more and more attractive in terms of tourism day by day. With the changing tourism dynamics, gastronomy tourists plan their travels by following flavors. In this study, the place and importance of fox grape in the provinces located on the Middle and Eastern coastline of the Black Sea region, and its potential as a product for gastronomy tourism were discussed. Depending on the result that the interest in gastronomic tourism has increased, the study is important in terms of proposing an alternative route of gastronomic product. The theoretical background of the study is based on the theory of planned behavior. The population of the study consists of Samsun, Ordu, Giresun, Trabzon, Rize and Artvin provinces located in the Middle and Eastern coastline of the Black Sea region. Data of the study were collected by document analysis, observation and interview techniques, which are among the qualitative research methods. The research was designed as a case study. In the study, a data analysis plan was created and the data were analyzed using the descriptive analysis method. The data obtained from the study reveal that fox grape has a potential that should be evaluated for the region. Therefore, the creation of gastronomic routes to fox grape can contribute to increasing the attractiveness of the destination and improving the image of the destination, and thus to increasing the local people’s quality of life

    Modelling and simulation studies on adaptive controller for alt-azimuth telescopes despite unknown wind disturbance and mass

    Get PDF
    Numerous ground-based observatories are using small sized ground telescopes for scientific research purposes. The telescopes that are available on the market have three main problems. These issues can be listed as: positioning repeatability, tuning requirement according to different wind speeds for different seasons, and the mass changing via different scientific equipments added to the telescope. This study is aimed at resolving these issues for ground based small alt-azimuth telescopes. Establishing of a set and forget system is performed by designing an adaptive controller. Motor dynamics are taken into consideration for a realistic mathematical model. The Wind-Gust model that consists of a sum of sinusoidal disturbances with unknown phase, amplitude and frequency is used for the wind model. The purposed controller cancels the disturbance effects on the plant while operational positioning and also the makes the plant insensitive to mass changes. The Lyapunov approach is utilised when proving the asymptotic stability. The proposed controller’s success is illustrated with thorough numerical evaluation.The authors would like to thank the technical guidance and funding support of Isik University, Center for Optomechatronics Research and Application (OPAM),and Ataturk University Center for Astrophysical Application and Research (ATASAM). The author(s) received no financial support for the research, authorship, and/or publication of this article.Publisher's Versio

    In-Hand 3D Object Scanning from an RGB Sequence

    Full text link
    We propose a method for in-hand 3D scanning of an unknown object with a monocular camera. Our method relies on a neural implicit surface representation that captures both the geometry and the appearance of the object, however, by contrast with most NeRF-based methods, we do not assume that the camera-object relative poses are known. Instead, we simultaneously optimize both the object shape and the pose trajectory. As direct optimization over all shape and pose parameters is prone to fail without coarse-level initialization, we propose an incremental approach that starts by splitting the sequence into carefully selected overlapping segments within which the optimization is likely to succeed. We reconstruct the object shape and track its poses independently within each segment, then merge all the segments before performing a global optimization. We show that our method is able to reconstruct the shape and color of both textured and challenging texture-less objects, outperforms classical methods that rely only on appearance features, and that its performance is close to recent methods that assume known camera poses.Comment: CVPR 202

    AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation

    Full text link
    We present AssemblyHands, a large-scale benchmark dataset with accurate 3D hand pose annotations, to facilitate the study of egocentric activities with challenging hand-object interactions. The dataset includes synchronized egocentric and exocentric images sampled from the recent Assembly101 dataset, in which participants assemble and disassemble take-apart toys. To obtain high-quality 3D hand pose annotations for the egocentric images, we develop an efficient pipeline, where we use an initial set of manual annotations to train a model to automatically annotate a much larger dataset. Our annotation model uses multi-view feature fusion and an iterative refinement scheme, and achieves an average keypoint error of 4.20 mm, which is 85% lower than the error of the original annotations in Assembly101. AssemblyHands provides 3.0M annotated images, including 490K egocentric images, making it the largest existing benchmark dataset for egocentric 3D hand pose estimation. Using this data, we develop a strong single-view baseline of 3D hand pose estimation from egocentric images. Furthermore, we design a novel action classification task to evaluate predicted 3D hand poses. Our study shows that having higher-quality hand poses directly improves the ability to recognize actions.Comment: CVPR 2023. Project page: https://assemblyhands.github.io
    corecore